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RAW SEQUENCE LISTING DATE: 08/14/2000 

PATENT APPLICATION: US/09/463,705 'tIME: 15:56:17 

input Set : A:\146.1335.txt . rcX^^ A^i'^ 

Output Set: N:\CRF3\08142000\I4 63705.raw ^^6® 



3 <110> APPLICANT: Cortes, J 

4 Fromentin, C v^ftC'^^ 

5 Gaisser, S 

6 Leadlay/ P 

7 Mendez, C 

8 Michel, J-M 

9 Raynal, M-C 

10 Salah-Bey, K 

11 Salas, J ^ 

13 <120> TITLE OF IN\^ENTION: BIOSYNTHESIS GENES & TRANSFER OF 6 -DESOXY-HEXOSES IN ^ 

14 SACCHAROPOLYSPORA ERYTHRAEA AND IN STREPTOMYCES ^ 

15 ANTIBIOTICUS AND THEIR USE 
17 <130> FILE REFERENCE: 146.1335 



O 

19 <140> CURRENT APPLICATION NUMBER: 09/463705 CD 



20 <141> CURRENT FILING DATE: 2000-02-23 ^ CO 

K CO 



•-TV 

22 <150> PRIOR APPLICATION NUMBER: PCT/FR98/01593 rj* q-j pTj 



23 <151> PRIOR FILING DATE: 1998-07-21 

25 <150> PRIOR APPLICATION NUMBER: 98/07411 f>o 

26 <151> PRIOR FILING DATE: 1988-06-12 -^^ ^ t*l 

28 <150> PRIOR APPLICATION NUMBER: 97/09458 q ^ 

29 <151> PRIOR FILING DATE: 1997-07-25 O 
31 <160> NUMBER OF SEQ ID NOS : 61 

3 3 <170> SOFTWARE: Patent In Ver, 2.1 

ERRORED SEQUENCES ^ A I 3^ Or^a ^^'S /97C/?Jc^ 

2689 <210> SEQ ID NO: 21 f i ^ ^/ / P / 

2690 <211> LENGTH: 246 _ ^'^^ .r/ - ^ A ,^ &r O r// TfCi/oi^r^ SQ.9V^^<-^^ 

2691 <212> TYPE: PRT^ ^ <^n O S^^^ec ^ e £ 0/?/<^Oc^^ Gr^^T^ ^^^V 

error /'j /"ect^^^ ' 





<400> SEQUENCE: 


21 
























2695 


Met 


Arg 


Ala 


Asp 


Thr 


Glu 


Pro 


Thr 


Thr 


Gly 


Tyr 


Glu 


Asp 


Glu 


Phe 


Ala 


2696 


1 








5 










10 










15 




2698 


Glu 


He 


Tyr 


Asp 


Ala 


Val 


Tyr 


Arg 


Gly 


Arg 


Gly 


Lys 


Asp 


Tyr 


Ala 


Gly 


2699 








20 










25 










30 






2701 


Glu 


Ala 


Lys 


Asp 


Val 


Ala 


Asp 


Leu 


Val 


Arg 


Asp 


Arg 


Val 


Pro 


Asp 


Ala 


2702 






35 










40 










45 








2704 


Ser 


Ser 


Leu 


Leu 


Asp 


Val 


Ala 


Cys 


Gly 


Thr 


Gly 


Ala 


His 


Leu 


Arg 


His 


2705 




50 










55 










60 










2707 


Phe 


Ala 


Thr 


Leu 


Phe 


Asp 


Asp 


Ala 


Arg 


Gly 


Leu 


Glu 


Leu 


Ser 


Ala 


Ser 


2708 


65 










70 










75 










80 


2710 


Met 


Leu 


Asp 


He 


Ala 


Arg 


Ser 


Arg 


Met 


Pro 


Gly 


Val 


Pro 


Leu 


His 


Gin 


2711 










85 










90 










95 




2713 


Giy 


Asp 


Met 


Arg 


Ser 


Phe 


Asp 


Leu 


Gly 


Pro 


Arg 


Val 


Ser 


Ala 


Val 


Thr 


2714 








100 










105 










110 






2716 


Cys 


Met 


Phe 


Ser 


Ser 


Val 


Gly 


His 


Leu 


Ala 


Thr 


Thr 


Ala 


Glu 


Leu 


Asp 
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RAW SEQUENCE LISTING DATE: 08/14/2000 

PATENT APPLICATION: US/09/463,705 TIME: 15:56:17 

Input Set : A:\146.L33 5.txt 

Output Set: N:\CRF3\08142000\I463705.raw 

2717 115 120 125 

2719 Ala Th"!: Leu Arg Cys Phe Ala Arg His Thr Arg Pro Gly Gly Val Ala 

2720 130 135 140 X \' ^ (2 ^ 
2 721 He Glu Pro Trp Trp Phe Pro Glu Thr Phe Thr Asp Gly Tyr Val J J> ^ 

E— > 2722 145 150 155 

2724 Ala Gly Asp He Val Arg Val Asp Gly Arg Thr He Ser Arg Val^Ser /)GC ^ 

E--> 2725 165 170 175 ] J 

2727 His Ser Val Arg Asp Gly Gly Ala Thr Arg Met Glu He His Tyr Val ' 
E--> 2728 180 185 190 

2730 He Ala Asp Ala Glu His Gly Pro Arg His Leu Val Glu His His Arg 
E--> 2731 195 200 205 

2733 He Thr Leu Phe Pro Arg His Ala Tyr Thr Ala Ala Tyr Glu Lys Ala 
E--> 2734 210 215 220 

2 736 Gly Tyr Thr Val Glu Tyr Leu Asp Gly Gly Pro Ser Gly Arg Gly Leu 
E--> 2737 225 230 235 240 

2739 Phe Val Gly Thr Arg Thr 
E--> 2740 245 
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<400> 20 



35 



50 



65 



115 



130 




gac 
Asp 


a eg 
Thr 
5 


gag 
Glu 


ccg 
Pro 


ace 
Thr 


ace 
Thr 


ggg 

Gly 
10 


tac 
Tyr 


gag gac 
Glu Asp 


gag 
Glu 


ttc 
Phe 
15 


gee 
Ala 


48 


gac 
Asp 
20 


gee 
Ala 


gtg 
Val 


tac 
Tyr 


egg 
Arg 


gge 
Gly 
25 


egg 
Arg 


ggc 
Gly 


aag 
Lys 


gac 
Asp 


tac 
Tyr 
30 


gee 
Ala 


gge 
Gly 


96 


gac 
Asp 


gtg 
Val 


geg 
Ala 


gac 
Asp 


etc 
Leu 
40 


gtg 
Val 


cgc 
Arg 


gac 
Asp 


Arg 


cfta 
Val 
45 


ccg 
Pro 


gae 
Asp 


geg 
Ala 


144 


ctg gac 
Leu Asp 


gtg 
Val 


gee 
Ala 
55 


tgc 
Cys 


ggc 
Gly 


acg 
Thr 


ggc 
Gly 


aea 
Ala 
60 


cac 
His 


etg egg 
Leu Arg 


cac 
His" 


192 


etc 
Leu 


ttc gac 
Phe Asp 
70 


gae 
Asp 


gee 
Ala 


cgc 
Arg 


ggt 
Gly 


etc 
Leu 
75 


gaa 
Glu 


ctg 
Leu 


tec 
Ser 


geg 
Ala 


age 
Ser 
80 


240 


ate 
He 


gee 
Ala 
85 


cgc 
Arg 


tec 
Ser 


cgc 
Arg 


atg 
Met 


ccg 
Pro 
90 


ggc 
Gly 


Qta 
Val 


ccg 
Pro 


etg 
Leu 


cac 
His 
95 


caa 
Gin 


288 


cga 
Arg 
100 


tec 
Ser 


ttc 
Phe 


gac 
Asp 


etg 
Leu 


ggg 

Gly 
105 


cea 
Pro 


cgc 
Arg 


gtc 
Val 


tec 
Ser 


geg 
Ala 
110 


gtc 
Val 


acc 
Thr 


336 


age 
Ser 


tee 
Ser 


gtc 
Val 


ggc 
Gly 


cac 
His 
120 


ctg 
Leu 


gee 
Ala 


acc 
Thr 


acc 
Thr 


gee 
Ala 
125 


gaa 
Glu 


etc 
Leu 


gac 
Asp 


384 


egg 
Arg 


tgc 
Cys 


ttc 
Phe 


gee 
Ala 
135 


egg 
Arg 


cac 
His 


ace 
Thr 


egg 
Arg 


cec 
Pro 
140 


ggc 
Gly 


ggc gtg 
Gly Val 


gee 
Ala 


432 


ccg 
Pro 


tgg 
Trp 


tgg 
Trp 
150 


ttc 
Phe 


ccg 
Pro 


gag 
Glu 


acc 
Thr 


ttc 
Phe 
155 


ace 
Thr 


gac 
Asp 


ggc tac 
Gly Tyr 


gtg 
Val 
160 


480 




<210> 21 
<211> 246 
<212> PRT 
<213> 




<400> 21 



Met Arg Ala Asp Thr Glu Pro Thr Thr Gly Tyr Glu Asp Glu Phe Ala 
15 10 15 

Glu lie Tyr Asp Ala Val Tyr Arg Gly Arg Gly Lys Asp Tyr Ala Gly 
20 25 30 

Glu Ala Lys Asp Val Ala Asp Leu Val Arg Asp Arg Val Pro Asp Ala 
35 40 45 

Ser Ser Leu Leu Asp Val Ala Cys Gly Thr Gly Ala His Leu Arg His 
50 55 60 

Phe Ala Thr Leu Phe Asp Asp Ala Arg Gly Leu Glu Leu Ser Ala Ser 
65 70 75 80 

Met Leu Asp lie Ala Arg Ser Arg Met Pro Gly Val Pro Leu His Gin 
85 90 ' 95 

Gly Asp Met Arg Ser Phe Asp Leu Gly Pro Arg Val Ser Ala Val Thr 
100 105 110 

Cys Met Phe Ser Ser Val Gly His Leu Ala Thr Thr Ala Glu Leu Asp 
115 120 125 

Ala Thr Leu Arg Cys Phe Ala Arg His Thr Arg Pro Gly Gly Val Ala 

130 135 140 

lie Glu Pro Trp Trp Phe Pro Glu Thr Phe Thr Asp Gly Tyr Val 




160 
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VERIFICATION SUMMAEY 

PATENT APPLICATION: US/09/463,705 

input Set : A:\146.1335.txt 

Output Set: N:\CRF3\08142000\I463705.raw 



DATE: 08/14/2000 
TIME: 15:56:18 



L:42 M:3 
(1046) ) 
L:48 M:3 
(3404) ) 
L:173 M: 
L:243 M: 
L:324 M: 
L:37B M: 
L:603 M: 
L:607 M: 
L:710 M: 
L:714 M: 
L:718 M: 
L:722 M: 
L:726 M: 
L:730 M: 
L:734 M: 
L:738 M: 
L:742 M: 
L:746 M: 
L:750 M: 
L:754 M: 
L:758 M: 
L:762 M: 
766 M: 
772 M; 
776 M: 
780 M: 
784 M: 
788 M: 
792 M: 
L:796 M: 
L:800 M: 
L:804 M: 
L:808 M: 
L:812 



51 w': Sequence data Name/Key Feature Out- of -Range, 
51 W: Sequence data Name/Key Feature Out- of -Range, 



M: 



816 M: 
820 M: 
824 M: 
828 M: 
832 M: 
836 M: 
L:840 M: 
L:844 M: 
L:848 M: 
L:852 M: 
L:856 M: 
L: 860 M: 
L:864 M: 
L:868 M: 



201 W 

201 W 

351 W 

201 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 

336 W 



Mandatory field data missing, ORGANISM 
Mandatory field data missing, ORGANISM 
Sequence data Name/Key Feature Out-of -Range , 
Mandatory field data missing, ORGANISM 

Invalid Amino Acid Number in Coding Region, SEQrID:6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID; 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID; 6 

Invalid Amino Acid Number in Coding Region, SEQ ID; 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID; 6 

Invalid Amino Acid Number in Coding Region, SEQ ID; 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 

Invalid Amino Acid Number in Coding Region, SEQ ID: 6 



SEQ ID#:1, CDS LOCATION: Complement (( 48 ). . 
SEQ ID#:1, CDS LOCATION: Complement (( 2322 ). . 

SEQ ID#:4, CDS LOCATION: complement (4).. (12 66) 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/463,705 



DATE: 08/14/2000 
TIME: 15:56:18 



Input Set ; 
Output Set: 



A:\146.1335 .txt 

N:\CRF3\0 8142000\I463 705.raw 



L 


872 


M 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


6 , 




L 


876 


M 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


6 




L 


880 


M 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


6 




L 


884 


M 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


6 




L 


888 


M 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


6 




L 


892 


M 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


6 




L 


896 


M 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


6 




L 


900 


M 


336 


W: 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


6 





L 


1122 


M 


201 


W: 


Mandatory 


field 


data 


missing , 


ORGANISM 


L 


1191 


M 


201 


W: 


Mandatory 


field 


data 


missing , 


ORGANISM 


L 


1276 


M 


201 


W: 


Mandatory 


field 


data 


missing , 


ORGANISM 


L 


1328 


M 


201 


W: 


Mandatory 


field 


data 


missing , 


ORGANISM 


L 


1430 


M 


201 


W: 


Mandatory 


field 


data 


missing , 


ORGANISM 


L 


1530 


M 


201 


W: 


Mandatory 


field 


data 


missing , 


ORGANISM 


L 


1696 


M 


201 


W: 


Mandatory 


field 


data 


missing. 


ORGANISM 


L 


2269 


M 


201 


W: 


Mandatory 


field 


data 


missing, 


ORGANISM 


L 


2353 


M 


201 


W: 


Mandatory 


field 


data 


missing, 


ORGANISM 


L 


2441 


M 


201 


W: 


Mandatory 


field 


data 


missing, 


ORGANISM 


L 


2529 


M 


201 


W: 


Mandatory 


field 


data 


missing. 


ORGANISM 


L 


2692 


M 


201 


W; 


Mandatory 


field 


data 


missing. 


ORGANISM 


L 


2722 


M 


332 


E: 


(32) Invalid/Missing 


Amino Acid Number 


M 


332 Repeated 


in SeqNo=21 








L 


2740 


M 


252 


E: 


No. of Seq. differs, 


<211>LENGTH: Input 


L 


2746 


M 


201 


W: 


Mandatory 


field 


data 


missing, 


ORGANISM 


L 


2758 


M 


201 


W: 


Mandatory 


field 


data 


missing, 


ORGANISM 


L 


2770 


M 


201 


W; 


Mandatory 


field 


data 


missing, 


ORGANISM 


L 


2782 


M 


201 


W: 


Mandatory 


field 


data 


missing, 


ORGANISM 


L 


2806 


M 


201 


W: 


Mandatory 


field 


data 


missing, 


ORGANISM 


L 


2818 


M 


201 


W: 


Mandatory 


field 


data 


missing, 


ORGANISM 


L 


2830 


M 


201 


W: 


Mandatory 


field 


data 


missing , 


ORGANISM 


L 


2842 


M 


201 


W: 


Mandatory 


field 


data 


missing, 


ORGANISM 


L 


2854 


M 


201 


W: 


Mandatory 


field 


data 


missing, 


ORGANISM 


L 


2866 


M 


201 


W: 


Mandatory 


field 


data 


missing, 


ORGANISM 


L 


2878 


M 


201 


W: 


Mandatory 


field 


data 


missing. 


ORGANISM 


L 


2890 


M 


201 


W: 


Mandatory 


field 


data 


missing. 


ORGANISM 


L 


2902 


M 


201 


W: 


Mandatory 


field 


data 


missing , 


ORGANISM 


L 


2914 


M 


201 


W: 


Mandatory 


field 


data 


missing. 


ORGANISM 


L 


2926 


M 


201 


W: 


Mandatory 


field 


data 


missing, 


ORGANISM 


L 


2938 


M 


201 


W: 


Mandatory 


field 


data 


missing, 


ORGANISM 


L 


2950 


M 


201 


W: 


Mandatory 


field 


data 


missing, 


ORGANISM 


L 


2962 


M 


201 


W: 


Mandatory 


field 


data 


missing, 


ORGANISM 


L 


2974 


M 


201 


W: 


Mandatory 


field 


data 


missing. 


ORGANISM 


L 


2986 


M 


201 


W: 


Mandatory 


field 


data 


missing. 


ORGANISM 


L 


2998 


M 


201 


W: 


Mandatory 


field 


data 


missing , 


ORGANISM 


L 


3010 


M 


201 


W: 


Mandatory 


field 


data 


missing , 


ORGANISM 


L 


3022 


M 


201 


W: 


Mandatory 


field 


data 


missing. 


ORGANISM 


L 


3036 


M 


201 


W: 


Mandatory 


field 


data 


missing, 


ORGANISM 


L 


3048 


M 


201 


W: 


Mandatory 


field 


data 


missing. 


ORGANISM 


L 


3060 


M 


201 


W: 


Mandatory 


field 


data 


missing , 


ORGANISM 



.ng, SEQ ID: 21 
246 Found;245 SEQ:21 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/463,705 



.DATE: 08/14/2000 
TIME: 15:56:18 



Input Set 
Output Set 



A:\146. 1335. txt 
N:\CRF3\08142000\I463705 .raw 



L: 


;3072 


M: 


201 


W: 


Mandatory 


field 


data missing, 


ORGANISM 


L: 


;3086 


M: 


201 


W: 


Mandatory 


field 


data missing, 


ORGANISM 


L; 


:3098 


M: 


201 


W: 


Mandatory 


field 


data missing. 


ORGANISM 


L: 


:3110 


M: 


201 


W: 


Mandatory 


field 


data missing. 


ORGANISM 


L: 


:3122 


M; 


201 


W: 


Mandatory 


field 


-data missing. 


ORGANISM 


L 


:3134 


M; 


;201 


W: 


Mandatory 


field 


data missing, 


ORGANISM 


L: 


:3146 


M: 


;201 


W: 


Mandatory 


field 


data missing. 


ORGANISM 


L: 


:3158 


M: 


:201 


W: 


Mandatory 


field 


data missing. 


ORGANISM 


L: 


:3170 


M: 


;201 


W: 


Mandatory 


field 


data missing. 


ORGANISM 
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